Effect of Language, Speaking Style and Speaker on Long-Term F0 Estimation

نویسندگان

  • Pablo Arantes
  • Anders Eriksson
  • Suska Gutzeit
چکیده

In this study, we compared three long-term fundamental frequency estimates – mean, median and base value – with respect to how fast they approach a stable value, as a function of language, speaking style and speaker. The base value concept was developed in search for an f0 value which should be invariant under prosodic variation. It has since also been tested in forensic phonetics as a possible speaker-specific f0 value. Data used in this study – recorded speech by male and female speakers in seven languages and three speaking styles, spontaneous, phrase reading and word list reading – had been recorded for a previous project. Average stabilisation times for the mean, median and base value are 9.76, 9.67 and 8.01 s. Base values stabilise significantly faster. Languages differ in both average and variability of the stabilisation times. Values range from 7.14 to 11.41 (mean), 7.5 to 11.33 (median) and 6.74 to 9.34 (base value). Spontaneous speech yields the most variable stabilisation times for the three estimators in Italian and Swedish, for the median in French and Portuguese and base value in German. Speakers within each language do not differ significantly in terms of stabilisation time variability for the three estimators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling dynamic prosodic variation for speaker verification

Statistics of frame-level pitch have recently been used in speaker recognition systems with good results [1, 2, 3]. Although they convey useful long-term information about a speaker’s distribution of f0 values, such statistics fail to capture information about local dynamics in intonation that characterize an individual’s speaking style. In this work, we take a first step toward capturing such ...

متن کامل

LR estimation using long term F0 as a parameter: good, bad or useless? Initial investigation using Japanese data

This paper investigates the validity of LR estimation for long-term F0 using Aitkin (1995)’s formula. Although this formula has been developed to estimate the LR of reflective index of glass fragments, previous studies such as Kinoshita (2001) and Rose, Osanai, and Kinoshita (2003) have shown that Aitkin’s formula can be applied to speech data. The experiments in this study revealed, however, t...

متن کامل

MeLos: Analysis and Modelling of Speech Prosody and Speaking Style

This thesis addresses the issue of modelling speech prosody for speech synthesis, and presents MeLos: a complete system for the analysis and modelling of speech prosody “the music of speech”. Research into the analysis and modelling of speech prosody has increased dramatically in recent decades, and speech prosody has emerged as a crucial concern for speech synthesis. The issue of speech prosod...

متن کامل

Between- and Within-Speaker Effects of Bilingualism on F0 Variation

To what extent is prosody shaped by cultural and social factors? Existing research has shown that an individual bilingual speaker exhibits differences in framing, ideology, and personality when speaking their two languages. To understand whether these differences extend to prosody we study F0 variation in a corpus of interviews with German-Italian and German-French bilingual speakers. We find t...

متن کامل

Acoustical Correlates to S Characteristics in Two

For synthesizing voice quality expressed by adjectives, this paper investigates acoustical correlates to adjective ratings of speaker characteristics for reading and conversational speech. The results revealed: (1) The speaking styles have little effect on the rates on adjective scales. (2) The effects of formant frequencies and long-term spectrum to adjective ratings are almost independent of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017